Search CORE

76 research outputs found

MeLinDa: an interlinking framework for the web of data

Author: Euzenat Jérôme
Scharffe François
Publication venue
Publication date: 21/07/2011
Field of study

The web of data consists of data published on the web in such a way that they can be interpreted and connected together. It is thus critical to establish links between these data, both for the web of data and for the semantic web that it contributes to feed. We consider here the various techniques developed for that purpose and analyze their commonalities and differences. We propose a general framework and show how the diverse techniques fit in the framework. From this framework we consider the relation between data interlinking and ontology matching. Although, they can be considered similar at a certain level (they both relate formal entities), they serve different purposes, but would find a mutual benefit at collaborating. We thus present a scheme under which it is possible for data linking tools to take advantage of ontology alignments.Comment: N° RR-7691 (2011

arXiv.org e-Print Archive

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

HAL Descartes

HAL-Rennes 1

SPARQL++ for mapping between RDF vocabularies

Author: Axel Polleres
François Scharffe
Roman Schindlauer
Publication venue: Vilamoura
Publication date: 01/01/2007
Field of study

Abstract. Lightweight ontologies in the form of RDF vocabularies such as SIOC, FOAF, vCard, etc. are increasingly being used and exported by “serious ” applications recently. Such vocabularies, together with query languages like SPARQL also allow to syndicate resulting RDF data from arbitrary Web sources and open the path to finally bringing the Semantic Web to operation mode. Considering, however, that many of the promoted lightweight ontologies overlap, the lack of suitable standards to describe these overlaps in a declarative fashion becomes evident. In this paper we argue that one does not necessarily need to delve into the huge body of research on ontology mapping for a solution, but SPARQL itself might — with extensions such as external functions and aggregates — serve as a basis for declaratively describing ontology mappings. We provide the semantic foundations and a path towards implementation for such a mapping language by means of a translation to Datalog with external predicates

CiteSeerX

Méthodes et outils pour lier le web des données

Author: Euzenat Jérôme
Scharffe François
Publication venue: No commercial editor.
Publication date: 19/01/2010
Field of study

scharffe2010aNational audienceLe web des données consiste à publier des données sur le web de telle sorte qu'elles puissent être interprétées et connectées entre elles. Il est donc vital d'établir les liens entre ces données à la fois pour le web des données et pour le web sémantique qu'il contribue à nourrir. Nous proposons un cadre général dans lequel s'inscrivent les différentes techniques utilisées pour établir ces liens et nous montrons comment elles s'y insèrent. Nous proposons ensuite une architecture permettant d'associer les différents systèmes de liage de données et de les faire collaborer avec les systèmes développés pour la mise en correspondance d'ontologies qui présente de nombreux points communs avec la découverte de liens

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

HAL-Rennes 1

Détection de clefs pour l'interconnexion et le nettoyage de jeux de données

Author: David Jérôme
Scharffe François
Publication venue: HAL CCSD
Publication date: 25/06/2012
Field of study

International audienceCet article propose une méthode d'analyse de jeux de données du Web publiés en RDF basée sur les dépendances de clefs. Ce type particulier de dépendances fonctionnelles, largement étudié dans la théorie des bases de données, permet d'évaluer si un ensemble de propriétés constitue une clef pour l'ensemble de données considéré. Si c'est le cas, il n'y aura alors pas deux instances possédant les mêmes valeurs pour ces propriétés. Après avoir donné les définitions nécessaires, nous proposons un algorithme de détection des clefs minimales sur un jeu de données RDF. Nous utilisons ensuite cet algorithme pour détecter les clefs de plusieurs jeux de données publiées sur le Web et appliquons notre approche pour deux applications : (1) réduire le nombre de propriétés à comparer dans le but de détecter des ressources identiques entre deux jeux de données, et (2) détecter des erreurs à l'intérieur d'un jeu de données

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

HAL Descartes

SPARQL-DL queries for antipattern detection

Author: Corcho Oscar
Roussey Catherine
Scharffe François
Šváb-Zamazal Ondřej
Publication venue: Facultad de Informática (UPM)
Publication date: 01/01/2012
Field of study

Ontology antipatterns are structures that reflect ontology modelling problems, they lead to inconsistencies, bad reasoning performance or bad formalisation of domain knowledge. Antipatterns normally appear in ontologies developed by those who are not experts in ontology engineering. Based on our experience in ontology design, we have created a catalogue of such antipatterns in the past, and in this paper we describe how we can use SPARQL-DL to detect them. We conduct some experiments to detect them in a large OWL ontology corpus obtained from the Watson ontology search portal. Our results show that each antipattern needs a specialised detection method

Archivo Digital UPM

SPARQL-based Detection of Antipatterns in OWL Ontologies

Author: Corcho Oscar
Roussey Catherine
Scharffe François
Šváb-Zamazal Ondřej
Publication venue: Facultad de Informática (UPM)
Publication date: 01/01/2010
Field of study

Ontology anti-patterns are structures that reflect ontology modeling problems because they lead to inconsistencies or to bad reasoning performance. Based on a collection of anti-patterns coming from our experience in ontology engineering projects and bad modeling practices found in the literature, we propose to represent them as SPARQL queries and conduct an experiment to detect them in an ontology corpus obtained from the Watson ontology search portal

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

HAL

Hal-Diderot

Archivo Digital UPM

Antipattern detection in web ontologies: an experiment using SPARQL queries

Author: Bernard Stephan
Corcho Oscar
Roussey Catherine
Scharffe François
Šváb-Zamazal Ondřej
Publication venue: Facultad de Informática (UPM)
Publication date: 01/01/2012
Field of study

Ontology antipatterns are structures that reflect ontology modelling problems because they lead to inconsistencies, bad reasoning performance or bad formalisation of domain knowledge. We propose four methods for the detection of antipatterns using SPARQL queries.We conduct some experiments to detect antipattern in a corpus of OWL ontologies

Archivo Digital UPM

Learning concise pattern for interlinking with extended version space

Author: Euzenat Jérôme
Fan Zhengjie
Scharffe François
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 11/08/2014
Field of study

fan2014bInternational audienceMany data sets on the web contain analogous data which represent the same resources in the world, so it is helpful to interlink different data sets for sharing information. However, finding correct links is very challenging because there are many instances to compare. In this paper, an interlinking method is proposed to interlink instances across different data sets. The input is class correspondences, property correspondences and a set of sample links that are assessed by users as either "positive" or "negative". We apply a machine learning method, Version Space, in order to construct a classifier, which is called interlinking pattern, that can justify correct links and incorrect links for both data sets. We improve the learning method so that it resolves the no-conjunctive-pattern problem. We call it Extended Version Space. Experiments confirm that our method with only 1% of sample links already reaches a high F-measure (around 0.96-0.99). The F-measure quickly converges, being improved by nearly 10% than other comparable approaches

Crossref

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

Specification of the delivery alignment format

Author: Euzenat Jérôme
Scharffe François
Serafini Luciano
Publication venue: HAL CCSD
Publication date: 30/12/2005
Field of study

euzenat2005hThis deliverable focusses on the definition of a delivery alignment format for tools producing alignments (mapping tools). It considers the many formats that are currently available for expressing alignments and evaluate them with regard to criteria that such formats would satisfy. It then proposes some improvements in order to produce a format satisfying more needs

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

HAL-Rennes 1